Goto

Collaborating Authors

 linear bandit problem







TightFirst-andSecond-OrderRegretBounds forAdversarialLinearBandits

Neural Information Processing Systems

In addition, we need only assumptions weaker than those of existing algorithms; our algorithms work on discrete action sets as well as continuous ones without apriori knowledge about losses, and theyrun efficiently ifalinear optimization oracle for the action set is available.




We thank the reviewer for the positive comments and encouraging us to point out the novelty in our

Neural Information Processing Systems

We thank the reviewers for providing detailed and quality reviews. A final novelty in our analysis is obtaining tighter bounds on the problem-dependent terms. We appreciate you pointing out several typos that we are fixing. Thanks for the detailed comments on the proof of Theorem 1. X and the index j is with respect to the m elements in Z . Thank you for the careful review and giving several useful suggestions.